Push and Pull: Iterative grouping of media
نویسندگان
چکیده
While many techniques use the traditional approach of time consuming groundtruthing large amounts of data [1, 2], this is increasingly infeasible as dataset size and complexity increase. Instead we propose a solution that allows the user to select media that semantically belongs to the same class and use machine learning to “pull” this and other related content together. We use real data harvested from the internet and propose an approach capable of incrementally clustering similar material using the manual identification of a few true positive and false positive examples. In order to provide both scalability and incremental learning, the approach needs to be efficient. We combine two popular data mining tools developed for the text analysis domain to efficiently compute distances between high dimensional representations and dynamically augment the representation with new compound visual words to form an image signature. These tools are applied to selected true and false positive examples of the media and rules learnt, that are applied to the full corpus of material. The media is then formed into groups of same class media using a greedy clustering approach. An image signature is constructed for each input sample; this is similar to a bag-of-words (BoW) histogram representation, and provides a compact, discrete representation of the input sample, as shown in Figure 1. In order to form the similarity between the image signatures, a data min-
منابع مشابه
An examination of the effects of push and pull factors on Iranian national parks: Boujagh National Park, Iran
This article analyses the push and pull factors that bring visitors to the Iranian national parks. The study used a structured questionnaire to collect data on these factors and the socio-demographic profile of the visitors. Survey conducted in Boujagh National Park, an area of 3177 hectares located in the north of the Iran, produced 400 questionnaires. The factor analysis identified four push ...
متن کاملبررسی نظام تولید و توزیع فرشدستباف به منظور ارائه راهکار مناسب با تکیه بر تبدیل نظام فشاری (Push) به نظام کششی (Pull)
This article is trying to study production and distribution system based on providing value chain with the aim of identifying production & distribution system of hand-made carpet firstly; and studying the feasibility of changing from the push system to the pull system regarding the viewpoints of the elite and expert, secondly. In order to achieve this goal, the descriptive method of research ha...
متن کاملModeling of Hybrid Production Systems with Constant WIP and Unreliable Equipment
Material flow in production systems can be controlled by a purely push-pull (just-in-time), or by a hybrid push-pull control mechanism. One type of push-pull production control can be implemented by controlling only the last stage during part withdrawals to trigger the production at the first stage. While the final stage is operated according to a pull mechanism, intermediate stages are operate...
متن کاملPALMS : Reliable P2P Live Media Streaming
In recent year, Peer-to-Peer (P2P) approach for media streaming has been studied extensively. In comparison to on-demand media streaming, P2P live media streaming faces a much stringent time constraint. In order to improve the performance metrics, such as startup delay, source-to-end delay, and playback continuity, we present PALMS, a P2P approach for live media streaming where node employs gos...
متن کاملMultiple Target Tracking in Wireless Sensor Networks Based on Sensor Grouping and Hybrid Iterative-Heuristic Optimization
A novel hybrid method for tracking multiple indistinguishable maneuvering targets using a wireless sensor network is introduced in this paper. The problem of tracking the location of targets is formulated as a Maximum Likelihood Estimation. We propose a hybrid optimization method, which consists of an iterative and a heuristic search method, for finding the location of targets simultaneously. T...
متن کامل